13 research outputs found

    Missing values estimation for skylines in incomplete database

    Get PDF
    Incompleteness of data is a common problem in many databases including web heterogeneous databases, multi-relational databases, spatial and temporal databases and data integration. The incompleteness of data introduces challenges in processing queries as providing accurate results that best meet the query conditions over incomplete database is not a trivial task. Several techniques have been proposed to process queries in incomplete database. Some of these techniques retrieve the query results based on the existing values rather than estimating the missing values. Such techniques are undesirable in many cases as the dimensions with missing values might be the important dimensions of the userโ€™s query. Besides, the output is incomplete and might not satisfy the user preferences. In this paper we propose an approach that estimates missing values in skylines to guide users in selecting the most appropriate skylines from the several candidate skylines. The approach utilizes the concept of mining attribute correlations to generate an Approximate Functional Dependencies (AFDs) that captured the relationships between the dimensions. Besides, identifying the strength of probability correlations to estimate the values. Then, the skylines with estimated values are ranked. By doing so, we ensure that the retrieved skylines are in the order of their estimated precision

    A model for computing skyline data items in cloud incomplete databases

    Get PDF
    Skyline queries intend to retrieve the most superior data items in the database that best fit with the userโ€™s given preference. However, processing skyline queries are expensive and uneasy when applying on large distributed databases such as cloud databases. Moreover, it would be further sophisticated to process skyline queries if these distributed databases have missing values in certain dimensions. The effect of data incompleteness on skyline process is extremely severe because missing values result in un-hold the transitivity property of skyline technique and leads to the problem of cyclic dominance. This paper proposes an efficient model for computing skyline data items in cloud incomplete databases. The model focuses on processing skyline queries in cloud incomplete databases aiming at reducing the domination tests between data items, the processing time, and the amount of data transfer among the involved datacenters. Various set of experiments are conducted over two different types of datasets and the result demonstrates that the proposed solution outperforms the previous approaches in terms of domination tests, processing time, and amount of data transferred

    Data backup and recovery with a minimum replica plan in a multi-cloud environment

    Get PDF
    Cloud computing has become a desirable choice to store and share large amounts of data among several users. The two main concerns with cloud storage are data recovery and cost of storage. This article discusses the issue of data recovery in case of a disaster in a multi-cloud environment. This research proposes a preventive approach for data backup and recovery aiming at minimizing the number of replicas and ensuring high data reliability during disasters. This approach named Preventive Disaster Recovery Plan with Minimum Replica (PDRPMR) aims at reducing the number of replicationsin the cloud without compromising the data reliability. PDRPMR means preventive action checking of the availability of replicas and monitoring of denial ofservice attacksto maintain data reliability. Several experiments were conducted to evaluate the effectiveness of PDRPMR and the results demonstrated that the storage space used one-third to two-thirds compared to typical 3-replicasreplication strategies

    Disaster recovery in cloud computing systems: an overview

    Get PDF
    With the rapid growth of internet technologies, large-scale online services, such as data backup and data recovery are increasingly available. Since these large-scale online services require substantial networking, processing, and storage capacities, it has become a considerable challenge to design equally large-scale computing infrastructures that support these services cost-effectively. In response to this rising demand, cloud computing has been refined during the past decade and turned into a lucrative business for organizations that own large datacenters and offer their computing resources. Undoubtedly cloud computing provides tremendous benefits for data storage backup and data accessibility at a reasonable cost. This paper aims at surveying and analyzing the previous works proposed for disaster recovery in cloud computing. The discussion concentrates on investigating the positive aspects and the limitations of each proposal. Also examined are discussed the current challenges in handling data recovery in the cloud context and the impact of data backup plan on maintaining the data in the event of natural disasters. A summary of the leading research work is provided outlining their weaknesses and limitations in the area of disaster recovery in the cloud computing environment. An in-depth discussion of the current and future trends research in the area of disaster recovery in cloud computing is also offered. Several work research directions that ought to be explored are pointed out as well, which may help researchers to discover and further investigate those problems related to disaster recovery in the cloud environment that have remained unresolved

    Skyline queries computation on crowdsourced- enabled incomplete database

    Get PDF
    Data incompleteness becomes a frequent phenomenon in a large number of contemporary database applications such as web autonomous databases, big data, and crowd-sourced databases. Processing skyline queries over incomplete databases impose a number of challenges that negatively influence processing the skyline queries. Most importantly, the skylines derived from incomplete databases are also incomplete in which some values are missing. Retrieving skylines with missing values is undesirable, particularly, for recommendation and decision-making systems. Furthermore, running skyline queries on a database with incomplete data raises a number of issues influence processing skyline queries such as losing the transitivity property of the skyline technique and cyclic dominance between the tuples. The issue of estimating the missing values of skylines has been discussed and examined in the database literature. Most recently, several studies have suggested exploiting the crowd-sourced databases in order to estimate the missing values by generating plausible values using the crowd. Crowd-sourced databases have proved to be a powerful solution to perform user-given tasks by integrating human intelligence and experience to process the tasks. However, task processing using crowd-sourced incurs additional monetary cost and increases the time latency. Also, it is not always possible to produce a satisfactory result that meets the user's preferences. This paper proposes an approach for estimating the missing values of the skylines by first exploiting the available data and utilizes the implicit relationships between the attributes in order to impute the missing values of the skylines. This process aims at reducing the number of values to be estimated using the crowd when local estimation is inappropriate. Intensive experiments on both synthetic and real datasets have been accomplished. The experimental results have proven that the proposed approach for estimating the missing values of the skylines over crowd-sourced enabled incomplete databases is scalable and outperforms the other existing approaches

    An enhanced group mobility management method in wireless body area networks

    Get PDF
    Mobility management of wireless body area networks (WBANs) is an emerging key element in the healthcare system. The remote sensor nodes of WBAN are usually deployed on subjectsโ€™ body. Certain proxy mobile IPv6 (PMIP) methods have been recommended, however, PMIP is relatively impractical in group mobility management pertaining to WBAN. It is likely to cause enormous registration and handover interruptions. This paper presents an approach aims at overcome these limitations using improved group mobility management method. The method emphasizes on incorporation of authentication, authorization, and accounting (AAA) service into the local mobility anchor (LMA) as an alternative to independent practice. Furthermore, proxy binding update (PBU) and AAA inquiry messages are merged. Additionally, AAA response and proxy binding acknowledge (PBA) message are combined. The experiment results demonstrate that the proposed method outperforms the existing PMIP methods in terms of delay time for registration, the handover interruptions and the average signaling cost

    Estimating missing values of skylines in incomplete database

    No full text
    Incompleteness of data is a common problem in many databases including web heterogeneous databases, multi-relational databases, spatial and temporal databases and data integration. The incompleteness of data introduces challenges in processing queries as providing accurate results that best meet the query conditions over incomplete database is not a trivial task. Several techniques have been proposed to process queries in incomplete database. Some of these techniques retrieve the query results based on the existing values rather than estimating the missing values. Such techniques are undesirable in many cases as the dimensions with missing values might be the important dimensions of the userโ€™s query. Besides, the output is incomplete and might not satisfy the user preferences. In this paper we propose an approach that estimates missing values in skylines to guide users in selecting the most appropriate skylines from the several candidate skylines. The approach utilizes the concept of mining attribute correlations to generate an Approximate Functional Dependencies (AFDs) that captured the relationships between the dimensions. Besides, identifying the strength of probability correlations to estimate the values. Then, the skylines with estimated values are ranked. By doing so, we ensure that the retrieved skylines are in the order of their estimated precision

    Minimum completion time for power-aware scheduling in cloud computing

    No full text
    Reducing power consumption has been an essential requirement for Cloud resource providers not only to decrease operating costs, but also to improve the system reliability. This paper tackles the problem of minimizing power consuming in datacenters hosts and improving their load balancing simultaneously. Cloud computing based on the idea of offering services going to be executed on datacenters. These datacenters need huge amount of power if they are in the peak load or the tasks are not distributed efficiently in their machines. The paper presents an algorithm for task scheduling that lowering the power consuming and reducing the total datacenter load. An empirical study has been done to simulate the proposed algorithm, which is proved by the results

    Skyline query processing for incomplete data in cloud environment

    No full text
    Many research works have been conducted focusing on pro-cessing skyline queries on databases. Recently, some approaches have been proposed to address the issue of skyline queries for a partially complete da-tabase in which data item values might not be presented (missing). Howev-er, these approaches are tailored for centralized database and accessed only one table to identify the skylines. Nevertheless, in many contemporary data-base applications, this is might not be the case, particularly for a database with incomplete data and many tables spread over various remote locations such as cloud environment. Applying skyline approaches designed for cen-tralized database directly on cloud databases is undesirable due to the pro-hibitive cost of transferring the amount of data from one datacenter to an-other during skyline process. An approach is needed taking into considera-tion the unique features of cloud environment when processing skyline que-ries on a database with incomplete data. This paper proposes an approach that evaluates skyline queries in a database with partially incomplete data over the cloud. The approach aims at reducing the number of pairwise com-parisons that needs to be conducted between data items and the amount of data transferred in identifying skylines. Several experiments over synthetic and real datasets have been conducted to evaluate the performance of our approach. The result shows that our approach outperforms the previous ap-proach in terms of a number of pairwise comparisons and amount of data transferred

    An empirical comparative study of instance-based schema matching

    No full text
    The main issue concern of schema matching is how to support the merging decision by providing matching between attributes of different schemas. There have been many works in the literature toward utilizing database instances to detect the correspondence between attributes. Most of these previous works aim at improving the match accuracy. We observed that no technique managed to provide an accurate matching for different types of data. In other words, some of the techniques treat numeric values as strings. Similarly, other techniques process textual instance, as numeric, and this negatively influences the process of discovering the match and compromising the matching result. Thus, a practical comparative study between syntactic and semantic techniques is needed. The study emphasizes on analyzing these techniques to determine the strengths and weaknesses of each technique. This paper aims at comparing two different instance-based matching techniques, namely: (i) regular expression and (ii) Google similarity to identify the match between attributes. Several analyses have been conducted on real and synthetic data sets to evaluate the performance of these techniques with respect to Precision (P), Recall (R) and F-Measure
    corecore